Video Shot Boundary Detection using Visual Bag-of-Words
نویسندگان
چکیده
Recently, convergence of techniques used in image analysis and video processing has occurred. Many computation and memory intensive image analysis methods have become available for per frame processing of videos due to increased computing power of desktop computers and efficient implementations on multiple cores and graphical processing units (GPUs). As our main contribution in this work, we solve the problem of shot boundary detection using a popular image analysis (object detection) approach: visual bag-of-words (BoW). The baseline approach for the shot boundary detection has been colour histogram and it is at the core of many top methods, but our BoW method of similar complexity in the terms of parameters clearly outperforms colour histograms. Interestingly, an “AND-combination” of colour and BoW histogram detection is clearly superior indicating that colour and local features provide complimentary information for video analysis.
منابع مشابه
MIC-TJU at MediaEval Violent Scenes Detection (VSD) 2014
The task of Violent Scenes Detection requires creating a system to detect segments which contain physical violence in both movies and videos found on the web, which is a very challenging task due to camera jitters in hand-shot videos and free shot boundary in movies and web videos. In this paper, we present a novel system by combining shot boundary detection, feature extraction in both audio an...
متن کاملShot Boundary Detection Using Shifting of Image Frame
Detection of shot boundary is the key step for identification of visual content of the video data. In this paper we propose a method of shot boundary detection with a novel logic of frame comparison. In our method, current frame of the video data is compared with the shifted version of the previous frame of that video and by using suitable threshold, shot boundary is declared. The experimental ...
متن کاملEgocentric Activity Recognition Using Bag of Visual Words
This paper presents an approach for recognizing activities using video from the egocentric setup. In this approach instead of using intermediate setup like object detection, pose estimation, modeling spatial distribution of visual words is implemented. The interactions are encoded by using Histogram oriented Pairwise Relation named (HOPR) between the visual words, orientations and alignments. A...
متن کاملShot-boundary detection: unraveled and resolved?
Partitioning a video sequence into shots is the first step toward video-content analysis and content-based video browsing and retrieval. A video shot is defined as a series of interrelated consecutive frames taken contiguously by a single camera and representing a continuous action in time and space. As such, shots are considered to be the primitives for higher level content analysis, indexing,...
متن کاملVideo Shot Boundary Detection Using Various Techniques
Shot and classification is first and foremost step for further analysis of video content . A Shot is defined as a set of frames from a single camera. Processing of video and image facilitates better understanding of the scene that it describe. It is a fundamental component of a number of technologies like video surveillance, robotics etc .A scene is a collection of one or more shots focusing on...
متن کامل